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APPARATUS FOR RECORDING OR REPRODUCING MULTIMEDIA 
DATA USING HIERARCHICAL INFORMATION STRUCTURE AND 
INFORMATION STORAGE MEDIUM THEREOF 

5 Technical Field 

The present invention relates to recording and reproducing 
multimedia data, and more particularly, to apparatuses to record and/or 
reproduce multimedia data using a hierarchical information structure and 
an information storage medium thereof. 

10 

Background Art 

A multimedia data recording/reproducing apparatus requires 
additional information, such as attributes of multimedia information or a 
sequence of data reproduction, to record the multimedia information on 

15 or to reproduce the multimedia information from a data storage medium. 
FIG. 1 illustrates the conventional multimedia data recording/reproducing 
apparatus 200 including the data storage medium 100 and connected to 
a user output device 300. The multimedia data recording/reproducing 
apparatus 200 may be controlled using a user input device 400, such as 

20 a remote control. In general, in order to record the additional 
information, additional information tables are formed at a particular 
position or in a particular file of the existing data storage medium. 
Examples of the data storage medium include compact discs (CDs), 
video CDs, and digital versatile discs (DVDs). In the additional 

25 information table, information is described by a location and a length of a 
data field. Thus, a new information table needs to be created whenever 
a new type of multimedia information medium is designed. In particular, 
navigation information, to select a reproduction unit or to determine the 
reproduction sequence, is recorded in table formats on the DVD. 

30 However, such navigation information is mixed with the information that 
defines reproduction units, thereby making it difficult to carry out a 
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navigation. 

The data structure of a conventional DVD will now be described. 
Referring to FIG. 2, a DVD includes a video manager (VMG) area and a 
plurality of video title set (VTS) areas. Control information and title 
5 selection information are stored in the VMG area, where the title 

information of a reproduction unit is stored in the plurality of VTS areas. 
The VMG area includes two or three files and each VTS area includes 
three to twelve files. 

The VMG area is illustrated in FIG. 3 in detail. The VMG area 
10 includes a video manager information (VMGI) area to store the additional 
information regarding the VMG area, a video object set (VOBS) area to 
store video object information of a menu, and a VMGI backup area. 
Each of the above areas includes a single file. The VOBS area may or 
may not be included in the VMG area, but the other two areas, the VMGI 
15 area and the VMGI backup area, are required. 

Title information and VOBS are stored in the VTS area. A 
plurality of titles may be recorded in the VTS area. The VTS area is 
illustrated in detail in FIG. 4. 

Referring to FIG. 4, video title set information (VTSI), the VOBS for 
20 menu, which is a video object set for the VTS menu, VOBS for title, 
which is the video object set for the titles in a VTS, and VTSI backup 
data are recorded in VTS area #n. However, the VOBS for the title may 
not be included in VTS area #n. Each VOBS is divided into a video 
object (VOB) on cells, which are recording units. Each VOB includes 
25 the cells. According to an aspect of the present invention, the cell is 
determined to be a lowest-ranked unit of data. 

A reproduction unit is represented by a hierarchical structure in 
which a title is present at a corresponding top level. The title includes 
one program chain (PGC) or a plurality of PGCS linked to one another. 
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Here, a first PGC is called an entry PGC. FIG. 5 illustrates the 
reproduction unit one_sequentiaLPGC_title, including only a single PGC 
(an entry PGC). FIG. 6 illustrates the reproduction unit in which the title 
is linked to the plurality of PGCs. After reproduction of the PGC, 

5 another PGC is reproduced by selecting from at least one of several 
PGCs. In this case, a selection command may be stored in program 
chain information (PGCI). Controlling the sequence of PGC 
reproduction is called the navigation. FIG. 7 illustrates the structure of 
the PGC. The PGC is stored in an information structure described as a 

10 PGCI format The PGCI includes a pre-command in which navigation 
commands are stored, a post-command, and a plurality of program 
information units. The pre-command is carried out before the 
reproduction of a related PGC and the post-command is carried out after 
the reproduction of the PGC. Each program information unit includes a 

is plurality of cell information units, each cell linked to the cell in the VOB, 
which is the recording unit. Each cell included in each reproduction unit 
has a cell command that is carried out after reproduction of the cell. 
Therefore, the PGCI represents a hierarchical reproducing structure of 
the PGC, i.e., the reproduction unit, in which the lowest-ranked 

20 reproduction unit cell is linked to the lowest-ranked record unit cell. 

FIG. 8 illustrates a case of branching a new PGC using command 
navigation information during or after reproduction of the PGC. In other 
words, it is possible to easily control the sequence of the PGC 
reproduction using navigation commands such as LinkPrevPGC, 
25 LinkTopPGC, LinkNextPGC, LinkTailPGC, LinkGoUpPGC, and 

LinkPGCN. That is, the PGC has reproduction units and also navigation 
information. 

A program in the PGC is referenced by a link called a part of title 

(PTT). 
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In the case of the DVD, the above information is stored in a binary 
table format, that is, the information is recorded in table formats where 
the information is recorded within a predetermined bit length on a 
particular position of the table. 

5 FIG. 9 illustrates a data structure of a TT_SRPT information table, 

which is title information in the VMGI. Referring to FIG. 9, the leading 
two bytes of the TT_SRPT information indicates a total number of titles n. 
The next two bytes of the TT_SRPT information is reserved for 
extension information defined in a future standard. The other bytes 
. 10 following the two bytes represent TTJ5RP information that individually 
describes the titles. In particular, a VTN number, designated by a 
related title, and a title number in a related VTS are recorded with 
predetermined bit lengths on certain positions of VTSN and VTS_TTN, 
respectively. 

15 FIG. 10 illustrates a data structure of VTS_PTT_SRPT information 

in the VTSI. Referring to FIG. 10, the VTS_PTT_SRPT information 
includes TTU_SRPs corresponding to a number of the titles of the 
related VTS. The respective TTU_SRPs include information to 
designate one of the PTT_SRPs following the TTlLSRPs. Therefore, a 

20 PTT_SRP between a PTT_SRP, which is designated by a TTU_SRP, 
and a PTT_SRP, which is designated by a next TTU_SRP, form the title. 
That is, a plurality of Part_pf_Title Search Pointers PTT-SRPs are 
present in each title, resulting in a division of the title into several PTTs. 
The PTT may be viewed as a chapter, for easier understanding. FIG. 1 1 

25 illustrates contents of the PTT_SRP, designating the PGC and a program 
in the PGC. In conclusion, the title is divided into several PTTs and 
each PTT is linked to a program in the PGC. 

FIG., 12 illustrates a data structure of a PGCIT VTS_PGCIT, 
representing a PGCI table in the VTSI. Referring to FIG. 12, a total 
30 number of the programs and the cells that belong to the PGC are stored 
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in a VTS_PGCITL Also, the VTS_PGCIT stores VTS_PGCIs as many 
as a plurality of VTS_PGCs, which belong to the VTS. FIG. 13 
illustrates a detailed data structure of the VTS_PGCI. In the VTS_PGCI, 
various information are recorded within particular bit lengths at particular 
5 positions in table formats, including a PGC_CMDT, which describes a 
pre-command, a post-command, a cell command, and so on. Also, the 
VTS_PGCI includes a PGC_PGMAP, which indicates program start cells 
as many as the programs, and a C_POSIT, which is the information for 
linkage of respective cells to respective record units. 

10 As described above, in the case of the conventional DVD, which is 

a multimedia data storage medium, the video object data and the data 
regarding the reproduction units and the navigation are recorded as the 
titles and the PGCs. However, the additional data in the PGCs is stored 
in the table format within particular lengths at a certain position. The 

15 commands to navigate are also stored in a limited space, together with 
the additional data, which define reproduction units. Therefore, the 
advent of a new reproduction unit results in a change of the table 
location; thus, making it difficult to implement an extension for the new 
reproduction. To solve this problem, reserved spaces are formed in a 

20 plurality of regions, which still limits any extension for the future. In the 
worst-case scenario, a table structure may be redefined. In fact, 
existing multimedia data storage media such as the CDs, the VCDs, the 
MDs, and the DVDs have table structures of their own. 

Meanwhile, advancements in Internet technologies trigger 
25 development of personal computers (PCs) capable of processing 
multimedia. For instance, video object data or audio data may be 
inserted into a document with HyperText Markup Language (HTML) or 
may be reproduced during reproduction of HTML. In this case, the 
navigation data is described with a script language or the like. 
30 Therefore, the navigation data may be described separate from 
reproduction data. If there are two types of navigation data, e.g., one 
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controlled using script language and another one described in the table 
format, it is complicated to control both types of navigation data. 

Disclosure of the Invention 

5 According to an aspect of the present invention, there is provided 

a data storage medium and a data reproduction apparatus. Assuming 
that the data reproduction apparatus operates as specified in the present 
invention, a recording apparatus records information, according to the 
data structure of which is specified in the present invention, and 
10 multimedia data on a storage medium. Here, storing operations are 
understood as identical to recording operations. 

Multimedia data and additional data are recorded in a storage 
medium, according to an aspect of the present invention. The present 
invention suggests separate recording of two different types of additional 

is data: additional information regarding record units, attributes, and 
reproduction units of the multimedia data, and navigation information 
regarding selection of a reproduction unit and a reproduction sequence. 
The additional information regarding a record unit, attributes, or a 
reproduction unit, with respect to multimedia data, is described using a 

20 markup language. Accordingly, implementation supporting an extension 
of a future standard is possible even when adding a new type of 
multimedia data or prescribing a new type of recording or reproduction 
unit, regardless of the standard adopted. Alternatively, the additional 
information may be stored in a binary table format. 

25 Also, both or one of a markup language and a script language 

may be used to describe navigation data, which represents selection of a 
reproduction unit or reproduction sequence. A markup language is also 
used to describe presentation data, which represents a menu screen to 
select a reproduction unit and a screen layout for data reproduction, 
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thereby enabling a menu structure and navigation with a high degree of 
flexibility. 

According to an aspect of the present invention, there is provided 
a multimedia data storage medium in which multimedia data is stored. 
5 The multimedia data storage medium includes a first layer in which the 
multimedia data, such as video object images, still images, voice, 
graphics, and texts, is stored; and a second layer in which, when the 
multimedia data is divided into the recording unit and the reproduction 
unit, information regarding attributes of the record unit and relationship 
10 between the record unit and the reproduction unit are described with the 
markup language using elements and the attributes. 

The navigation data, which is used to control a selection of the 
reproduction unit and the reproduction sequence, may be recorded on a 
third layer using the markup language or the script language, in addition 
15 to the information recorded on the first and second layers. 

Additional aspects and/or advantages of the invention will be set 
forth in part in the description which follows and, in part, will be obvious 
from the description, or may be learned by practice of the invention. 



20 Brief Description of the Drawings 

The above and/or other aspects and advantages of the present 
invention will become more apparent by describing in detail a preferred 
embodiment thereof with reference to the attached drawings in which: 

FIG. 1 illustrates a conventional multimedia data 
25 recording/reproducing apparatus; 

FIG. 2 illustrates a data structure of a conventional DVD; 
FIG. 3 illustrates a VMG area; 
FIG. 4 illustrates a VTS area; 

FIG. 5 illustrates a reproduction unit one_sequentiaLPGC_title, 
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including only a single PGC; 

FIG. 6 illustrates the reproduction unit in which a title is linked to a 
plurality of PGCs; 

FIG. 7 illustrates a structure of the PGC; 
5 FIG. 8 illustrates a case of branching a new PGC using command 

navigation information during or after reproduction of the PGC; 

FIG. 9 illustrates a data structure of a TT_SRPT information table; 

FIG. 10 illustrates a data structure of VTS_PTT_SRPT information 
in a VTSI; 

10 FIG. 1 1 illustrates contents of a PTT_SRP, designating the PGC 

and a program in the PGC; 

FIG. 12 illustrates a data structure of a PGCIT VTS__PGC!T, 
representing a PGCI table in the VTSI; 

FIG. 13 illustrates a detailed data structure of a VTS_PGCI; 
15 FIG. 14 illustrates a filel .mpg and a file2.mpg, in accordance with 

an aspect of the present invention; 

FIG. 15 illustrates two video object clips, in accordance with an 
aspect of the present invention; 

FIG. 16 illustrates a position of video object data at a time gap 
20 position recorded in a table format, in accordance with an aspect of the 
present invention; 

FIG. 17 illustrates a video object file, in accordance with an aspect 
of the present invention; 

FIG. 18 illustrates the reproducing apparatus, in accordance with 
25 an aspect of the present invention; and 

FIG. 19 illustrates a method of forming a menu screen for 
navigation, in accordance with an aspect of the present invention. 

Best mode for carrying out the Invention 

30 Reference will now be made in detail to the present aspects of the 

present invention, examples of which are illustrated in the accompanying 
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drawings, wherein like reference numerals refer to the like elements 
throughout. The aspects are described below in order to explain the 
present invention by referring to the figures. 

According to an aspect of the present invention, a storage medium 
5 on which a video object title is recorded, being supported by a plurality of 
voice types and subtitles. Additional information may be hierarchically 
recorded and a markup language is used to implement each layer for 
extensibility. The markup language, which describes record units and 
reproduction, units, is called a media description language. 

10 Referring to FIG. 14, filel .mpg, which is a first half of data 

representing a video object title, and file2.mpg, which is the other half, 
are recorded in a first layer. A title may be divided into a plurality of files 
given a size limit of a chapter unit or a file. Video object data is 
compressively encoded to reduce a data amount thereof. MPEG, which 

15 is one of the most popular motion picture compression methods, 

supports a variable bit rate (VBR) encoding method in which a bit rate 
per hour varies according to an amount of video information. A time 
search determining a starting point of the data reproduction with 
consideration to a given time requires table-type information to map the 

20 time and a position of the data. The information is used to detect the 
position of desired data since a predetermined time after a start of the 
data reproduction. In general, the table-type information includes 
information regarding data positions measured at every predetermined 
point of time. For instance, the table-type information may be time map 

25 information that represents temporal position linking information 

indicating the data positions measured with respect to the file beginning 
every 1 0 seconds. However, because the amount of time map 
information is significant, the information is recorded in a binary table 
format, rather than using the markup language, and stored in the first 

30 layer. FIG. 14 illustrates the time map information file1timemap.dat and 
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file2timemap.dat, regarding the video object data filel .mpg and file2.mpg, 
respectively. According to an aspect of the present invention, actual 
time map information is illustrated in FIG. 16. 

Additional data, regarding multimedia data recorded on the first 
5 layer, is recorded on a second layer. The additional data defines a 
reproduction unit to appropriately reproduce the multimedia data on the 
first layer. Here, the reproduction unit rv&y be divided into record units 
or units of storage, which are described in a binary table format, or in an 
alternative aspect of the present invention, using the markup language, 
10 and stored as a description. xml file. Navigation information, which is to 
be added to the additional data, is recorded on a third layer as a 
menu.xml file. The stored navigation information determines a selection 
and sequences of the data reproduction by controlling the reproduction 
unit recorded on the second layer. 

15 In general, a menu screen is organized in a recording medium, on 

which a plurality of titles or chapters are recorded, to allow random 
access of a particular title or chapter in a user selection and immediate 
reproduction at a specified position. In the menu screen, still images 
and buttons are generally formed. Also, background music may be 

20 reproduced. When a user selects a button, a function associated with 
the button is executed. Referring to FIG. 14, the still images and music 
data included in the menu screen are recorded as file3.jpg and file4.mp3 
files on the first layer. 

The additional data recorded on the second layer describes the 
25 information regarding the data recorded on the first layer. Video object 
data is multiplexed in a data stream to synchronize video, audio, and 
graphic data. The attributes of the record units of the video object data 
are described as description.xml, using the attributes of the video, audio, 
and graphic data, and numbers of audio, video, and graphic data. 
30 Further, the additional data provides information regarding reproduction 
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units that are generated by a combination or a selection of the record 
units. 

In an aspect of the present invention, a 'clip' is used as the record 
unit and 'cells', 'chapters', and 'titles' are used as the reproduction units. 

5 The following description provides explanations on these units (see FIG. 
15): clip: The clip is an object described in relation to the recording of the 
multimedia data. FIG. 1 5 illustrates two video object clips. The video 
object clip has Information about the time and the position. The data 
belonging to the clip can be continuously reproduced. That is, an mpg 

10 file including the video object data and a time map file including the 
temporal position information are combined to form the clip. The time 
map file includes the additional information that enables a quick search 
for a desired temporal position of the video object data when the video 
object is VBR encoded. 

15 That is, if the video object file is the VBR encoded as shown in FIG 

17, the position of the video object data at a time gap position, 
designated by predetermined time intervals from a starting position, is 
recorded in the table format shown in FIG 16. If the data position in the 
table is called an entry, a total number of entries and time gaps may be 

20 recorded at a beginning of the table. A search of a desired position of 
the data, with respect to a predetermined instant of time, can be 
accomplished by detecting the time gap position most proximate to a 
predetermined instant of time recorded in the table. The desired data 
may be precisely reproduced by reading the data starting from the 

25 detected position. If the data is recorded at a constant bit rate (CBR), it 
is possible to maintain the coded data amount generated for a 
predetermined time. In this case, the clip can be constructed with only 
video object data because it is possible to detect the desired position of 
the data using the time calculations according to the CBR without the 

30 time map information. 
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The following information is used to define the video object clip 
including the video data, a plurality of audio data groups, and a plurality 
of graphic data groups: 

(a) general attributes: clip identification information, video object 
5 clip files, and clip temporal position information file (time map file); 

(b) video: screen size (e.g., 1920'1080, 1280720, and 720'480), 
average bit rate (e.g., 4M, 6M, 10M, and 20M), screen output rate (e.g., 
60Hz, 30Hz, and 24Hz), scanning types (e.g., progressive and interlaced 
types); 

10 (c) audio: audio stream identification information, audio encoding 

information, linguistic attributes of each audio data group (e.g., Korean, 
and English), and application attributes of each audio data group (e.g., 
main audio, sub audio, and commentary); and 

(d) graphic: graphic stream identification information, graphic 

15 encoding information, linguistic attributes of each graphic data group 
(e.g., Korean, and English), application attributes of each graphic data 
group (e.g., subtitles, and animation) 

The above information, however, may be changed according to 
the recording type, encoding type, and multiplexing type of the 
20 multimedia data. Also, the record unit may form a hierarchical structure, 
and, thus, the record unit subordinate to the clip may be present. For 
convenience, according to an aspect of the present invention, the record 
unit is determined to be made of clips. 

In general, the reproduction unit has the hierarchical structure, 
25 that is, the reproduction unit includes a plurality of reproduction sub-units. 
In this case, the reproduction sub-unit is defined as a unit of a 
reproduction sequence or a point of random access. 

Referring to FIG. 15, a cell is the reproduction unit that is 
described in relation to the reproduction of the multimedia data, each cell 
30 designating the clip or a portion of the clip. The cell represents a lowest 
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layer of the reproduction unit That is, the cell, i.e., a reproduction unit, 
is linked to at least one clip, i.e., the record unit, in the reproduction of 
the multimedia data. 

The cell is defined using the identification information, and the 
5 starting and ending times of the related clip. However, if the cell 

designates an entire clip, not a portion thereof, the starting and ending 
times are not additionally specified. 

Referring to FIG. 15, a chapter is the reproduction unit described 
in relation to the reproduction of the data, each chapter including at least 
10 one cell. In general, the chapter is defined by a chapter name, chapter 
identification information, and information regarding at least one cell 
belonging to the chapter. 

According to an aspect of the present invention, the chapter can 
be understood as a reproduction point that allows the user to perform the 
15 random access. Thus, within the title, the user can search for or 

reproduce the desired data in units of chapters. In general, the menu 
screen provides a menu in which the desired data can be selected in the 
units of chapters. 

Referring to FIG. 15, a title includes a plurality of chapters. A 
20 plurality of titles may be stored in a storage medium and can be 

sequentially reproduced according to a sequence of title identification 
signs. In general, the menu screen provides a menu that allows 
selection of the title, the title defined by a title name, a title identification 
sign, and information regarding at least one chapter belonging to the title. 

25 FIG. 15 illustrates a relationship between video object data 

recorded on the first layer, and clips, chapters, and titles recorded on the 
second layer. 
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According to an aspect of the present invention, the information 
regarding the second layer is described in two types of units, i.e., the 
record units and the reproduction units, while the data regarding the 
navigation is eliminated. In general, the data is described in a binary 

5 table format or through the markup language. In an aspect of the 
present invention, the data is described using the markup language 
because the markup language is more advantageous than the binary 
table. Extensible markup language (XML), a representative example of 
the markup language, is defined in W3C, which prescribes 

10 recommendations for the Internet. With XML, it is possible to describe a 
variety of databases and documents. When information regarding the 
second layer is described with XML, it is very easy to ensure extendibility 
and backward compatibility. 

Hereinafter, the XML-based language will be defined describing 
15 the additional information to be stored in the second layer, according to 
an aspect of the present invention. The XML-based language is 
referred to as media description language (MDL). XML is described with 
a combination of hierarchical elements. Also, each element may have 
several attributes. The name of the element is described within a sign 
20 '<>'. The regulations to describe a sub element are mentioned below. 

The MDL, according to an aspect of the present invention, has 
elements and attributes as described below. Here, a document is 
understood as a unit of data that is described and stored utilizing the 
markup language. 

25 (a) The uppermost element of the MDL document is described using 
<mdl> or an equivalent value. 

(b) The element <mdl> may have the following sub-elements: 

- <head> 

- <body> 

30 (c) An element <head> contains all information regarding a storage unit 
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and may have the following sub-element: 

- <meta> 

(d) An element <meta> defines a blank element in which features of a 
document are defined and appropriate values are allocated to the 
5 features. Each <meta> element denotes a pair including an attribute 
and a value. 

The attributes of elements defined in the element <meta> are as 
follows: 

- Name: A name denotes a document feature defined in the element 
10 <meta>. The name is an attribute indispensable to the element 

<meta>. 

- Content: A content denotes a feature value defined in the element 
<meta>. The content is also an attribute indispensable to the element 
<meta>. 

15 The following are examples of the element <meta> excluding 

conventional examples of the element <meta>: e.g., (i) <meta 
name-type" content="mdl-disc" />, which describes a disc that is 
manufactured using the media description language; and (ii) <meta 
name-'region" content- 1 !" />, which indicates that the regional code of 

20 the disc is 1. 

(e) The element <body> may have the following sub-elements: 

- <clip>, and 

- <title> 

(f) The element <clip> may have the following attributes: 

25 - Id: representing the identification information for each clip. Id varies 
according to the type of storage medium (e.g., 1 , 2, or 3), 

- src: representing a video object data file such as 'filel .mpg\ and 

- tmap_src: representing a file containing a time map table (e.g., 
file1timemap.dat) 
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The element <clip> may have the following sub-elements: 

- <vldeo>, 

- <audio>, and 

- <graphio 

5 (g) The element <video> may have the following attributes: 

- resolution: image sizes in vertical and horizontal directions, e.g., 
1920'1080, or 1280720, 

- frame_rate: a number of frames output per second, e.g., 60, 30, 24, 
or 50, 

10 - scanning: indicating whether an image is sequentially scanned or not, 
i.e., whether the image is progressively scanned or interlaced scanned, 

- bit_rate: average bit rate, e.g., vbr, 4m, 6m, 8m, 10m, or 20m, and 

- stream_id: stream id of an MPEG PES stream, e.g., OxeO 

(h) The element <audio> may have the following attributes: 

15 - encoding: representing an encoding method, e.g., mp1, mp2, mp3, 
ac3, Ipcm, or dts, 

- sampling_rate: sampling rate, e.g., 48k, 96k, or 192k, 

- quantization_bit: a number of quantized bits, e.g., 1 6, or 24, 

- bit_rate: an encoded bit rate, e.g., vbr, 128k, or 384k, 
20 - channel_no: a total number of channels, e.g., 2, 5, or 7, 

- language: linguistic attributes, e.g., none, en, ko, jp, orfr, 

- application: usage of an audio stream, e.g., main, sub, or 
commentary, 

- stream_id: stream ID of an MPEG PES stream, e.g., OxcO, or Oxbd, 
25 and 

- sub_stream_id: sub stream ID of an MPEG PES stream, e.g., none, 
0x80, or OxaO. 

(i) The element <graphic> may have the following attributes: 

- encoding: representing the encoding method, e.g., dvd_subpicture, 
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- application: usage of s-graphic stream, e.g., animation or sub_title, 

- language: linguistic attributes, e.g., none, en, ko, jp, orfr, 

- strearnjd: Stream ID on an MPEG PES stream, e.g., OxBD, and 

- sub_stream_id: Sub stream ID on an MPEG PES stream, e.g., 0x20. 

5 

(k) The element <title> may have the following attributes: 

- name: title name, e.g., White Snow, 

- Id: title identification information that varies according to the type of 
storage medium, e.g., 1 , 2, or 3. 

io Also, the element <title> may have the following sub elements: 

- <chapter>. 

(I) The element <chapter> may have the following attributes: 

- name: chapter name, e.g., Dwarf, and 

15 - Id: chapter identification information that varies according to a title, 
e.g., 1, 2, or 3. 

Also, the element <chapter> may have the following sub-element: 

- <cell>. 



20 (m) The element <cell> may have the following attributes: 

- clip_id: identification number of a clip to which the cell is linked, 

- start_time: starting time in a clip clip_id, and 

- end_time: ending time in a clip clip_id. 

A summary of the above attributes and elements is shown in Table 

25 1. 

Table 1 



Element 


Attribute 


Sub Element 


<mdl> 




<head>/<body> 


<head> 




<meta>* 
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<meta> 


name, content 


EMPTY 


<body> 




<clip>+/<title>+ 


<clip> 


id ^xc\ tm^in qto 

IU, Ol O, UIIGIL/ Ol U 


^viueo^/ , ^auQio^ > /^grapmc^ 


<video> 


resolution, frame_rate, 
scanning, bit rate, stream id 


EMPTY 


<audio> 


pnpoH inn Qamnlinn roto 
ouuuuiiiy, oclilipilliy idlC) 

nuanti7fltinn hit hit rata 
channel nn lanrmania 
application, stream Jd, 
sub stream id 


PM PTV 


<graphic> 


encoding, application, 
language, strearnjd, 
sub stream id 


EMPTY 


<title> 


name, id 


<chapter>+ 


<chapter> 


name, id 


<cell>+ 


<cell> 


clip id, start time, end time 


EMPTY 



The additional data of FIG. 15, according to a preferred 
embodiment of the present invention, may be described using the above 
elements and attributes, as follows: 

5 

<?xml version="1.0"?> 
<!DOCTYPE mdl 

PUBLIC "http://mdl-disc.org/mdl1/mdl.dtd"> 

10 <mdl> 
<head> 

<meta name="type" content="mdl-disc" /> 
< meta name="title" content="White Snow Special" /> 
<meta name="region" content="1 " /> 
15 </head> 

<body> 

<clip id="1" src="file1 .mpg" tmap_src="file1tmap.dat"> 

<video resolution="1 920x1 080" frame_rate="24" 
20 scanning="progressive" bit_rate="vbr" 

stream_id="0xe0" /> 

<audio encoding="ac3" sampling_rate="48k" 
quantization_bit="1 6" 

bit_rate="384k" channel_no="5" language="en" 

25 

application="main" stream_id="0xbd" sub_stream_id="0x80" /> 
<audio encoding="ac3" sampling_rate="48k" 
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quantization_bit="24" 

bit_rate="384k" channel_no="5" language="ko" 

application="main" 
stream_id="0xbd" sub_stream_id="0x81" /> 
5 <audio encoding="ac3" sampling_rate="48k" 

quantization_bit="24" 

bit_rate="384k" channel_no="5 n language="jp" 

application="main" stream_id="0xbd" sub_stream_id="0x84" /> 
10 <audio encoding="mp1" sampling_rate="48k" 

quantization_bit="1 6" 

bit_rate="384k" channel_no="2" language="en" 

application="commentary" stream_id="0xc0" 
15 sub_stream_id="none" /> 

<graphic encoding="dvd_subpicture" 
application="sub_title" 

language="en" stream_id="Oxbd" sub_stream_id="0x20" /> 
<graphic encoding="dvd_subpicture" 
20 application="sub_title" 

language="ko" stream_id="Oxbd" sub_stream_id="0x21 " /> 
<graphic encoding="dvd_subpicture" 
application="sub_title" 

language="jp" stream_id="Oxbd" sub_stream_id="0x22" /> 
25 </clip> 

<clip id="5" src="file5.mpg" tmap_src="filek5map.dat"> 
<video resolution="1 920x1 080" frame_rate="24" 

scanning-'progressive" 

30 bit_rate="vbr" stream_id="0xe0" /> 

<audio encoding="ac3" sampling_rate="48k" 
quantization_bit="16 n 

bit_rate="384k" channel_no="5" language="en" 

35 application="main" stream_id="0xbd" sub_stream_id="0x80" /> 
<audio encoding="mp1" sampling_rate="48k" 
quantization_bit="1 6" 

bit_rate="384k" channel_no="2" language="en" 

40 application-'commentary" stream_id="0xbd" 

sub_stream_id="0x83" /> 

</clip> 

<tit!e id="1" name="White Snow"> 
45 <chapter id="1" name="Mirror"> 

<cell clip_id="1" start_time="0:00:00" 
end_time="0:05:007> 
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</chapter> 

<chapter id="2" name="7 Dwarves"> 

<cell clip_id="1" start_time="0:05:00" 
end_time="0:20:007> 
5 <cell clip_id="2" start_time="0:00:00" 

end_time= n 1:00:00'7> 

</chapter> 

</title> 

10 <title id= n 2" name="White Snow Supplementary^ 

<chapter id="1" name="How to make Snow White"> 
<cell clip_id="5" start_time="0:00:00 n 
end_time="0:20:007> 

</chapter> 

15 </title> 
</body> 
</mdl> 



In an aspect of the present invention, actual video object data 
20 includes five clips represented in the record units, and two titles 
represented in the reproduction units. 

In the case of the VBR video object data, the clip is described as 
time position information data, which includes a portion of time map 
information. Thus, in an aspect according to the present invention, each 
25 clip is described using the two attributes src and tmap_src, such that the 
clip data and the titles are linked to each other. In addition, each clip 
includes many attributes of video, audio, and graphic data, and is 
referred to prior to the data reproduction. 

Title 1 is a subject title including a plurality of chapters. The first 
30 chapter includes a cell linked to a portion of a clip #1 . Thus, only the 
portion of the clip #1 is reproduced during the reproduction of the first 
chapter. The second chapter includes two cells. The first cell 
designates the reproduction of the data after time 0:05:00 of the clip #1 . 
Accordingly, in order to start the reproduction from the second chapter, 
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the position of desired data must be detected using the time map 
information, specifically, the time position information regarding clip #1 . 

As shown in a corresponding title name attribute, title 2 includes 
an additional image, i.e., a supplementary image. The title 2 is recorded 
5 as clip #5. 

As mentioned above, improved extensibility can be achieved by 
describing the information regarding the second layer using the markup 
language. In other words, an information structure of a new concept 
can be described using new elements or attributes. Then, the existing 

io reproducing apparatuses can reproduce the data using the existing 
information rather than newly generated information. That is, it is 
possible to maintain the reproduction of the data regarding the existing 
element using a conventional reproducing apparatus. Assuming that a 
new element <bookmark> is added to the <title> element and the 

15 following information is recorded on the second layer: 

<title id="1"... > 

<bookmark point="1 :00:00" /> 

<chapter id="1"..> 

20 

</title> 

The element <bookmark> is an information structure that is newly 
defined to allow a direct access to a particular position of the title. 
25 When the storage medium, on which the element <bookrnark> is 

recorded, is inserted into the conventional reproducing apparatus, it is 
possible to reproduce the data recorded on the storage medium using 
the title and the chapter information. However, the bookmark 
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information is ignored because the bookmark information cannot be 
reproduced using the conventional reproducing apparatus. 

As mentioned above, the multimedia data can be reproduced 
using only the data recorded on the first and second layers. In this case, 
5 the reproducing apparatus reads the information recorded on the second 
layer, and determines the format of recorded data recorded, the title, and 
the number of chapters belonging to each title. Next, the reproducing 
apparatus informs the user of the detection result through an appropriate 
user interface, receives a user input, and reproduces the desired 

10 reproduction unit. The user interface includes a user output device 300 
and a user input device 400 as shown in FIG. 1 . In detail, the user 
output device 300 is an apparatus, such as a television (TV), that outputs 
multimedia data, and the user input device 400 is an apparatus, such as 
a remote control, that receives a user input. The user interface, 

is according to an aspect of the present invention, includes a chapter menu 
in a menu screen, which allows selection of the chapters belonging to 
each title. The user selects a title number or a chapter number from 
each menu using the remote control 400, resulting in the detection and 
the reproduction of a desired reproduction unit. 

20 The storage medium 1 00, according to an aspect of the present 

invention, further includes navigation information recorded on a third 
layer. The navigation, which is similar to a conventional menu, includes 
a selection using the user input device 400, such as the remote control 
and reproduction of the reproduction unit 200 to a user output device, 

25 such as a TV. Also, the navigation may include the control of the 

following data reproduction using a current state of the data reproduction. 
Here, the recording apparatus is as shown in FIG. 1. Also, the 
sequences of the data reproduction may be determined differently. For 
instance, a parental level control may be performed during the data 
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reproduction, that is, the sequences of the data reproduction may be 
determined depending on whether the user is an adult or a juvenile. 

To this end, the reproducing apparatus 200 may be manufactured 
as show in FIG. 18. Referring to FIG. 18, a playback engine denotes a 

5 block to process the data recorded on the first and second layers. If the 
storage medium does not contain the data in the third layer, a 
presentation and navigation engine of FIG. 18 converts the user input 
into an Application Program Interface (API), which can be recognized by 
the playback engine, and provides the API to the playback engine. If the 

10 user input is key input and recognized by the playback engine, the user 
input is directly transmitted to the playback engine. 

The data recorded on the third layer includes presentation data, 
which is used to arrange the menu screen or the reproduction unit in the 
screen, and the navigation data, which is used to select the reproduction 

15 unit in response to the user input or the control data reproduction, 

according to a state of a certain playback engine. The presentation data, 
according to an aspect of the present invention, is described using 
HyperText Markup Language (html) or Extensible HyperText Markup 
Language (xhtml). Also, the navigation data may be described with a 

20 Script language or a markup language capable of describing timing and 
synchronizing. A typical example of the Script language is Java script 
that is interpreted and executed in units of lines, and a typical example of 
the markup language having timing and sync definitions is synchronized 
markup interface language (SMIL). 

25 The navigation engine performs navigation by controlling the 

reproduction unit recorded on the second layer, according to the user's 
selection or an event generated by the playback engine. 

The following describes methods of laying out reproduction units, 
recorded on the second layer, in a screen using XHTML and JavaScript, 
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and controlling navigation, according to an aspect of the present 
invention. 

An example of controlling navigation with a markup document, 
which allows the user input key to be directly transmitted to the playback 
5 engine, is as follows: 

- Markup Document 1 

<?xml version="1.0" encoding="UTF-8"?> 
<!DOCTYPE html 
10 PUBLIC "-//W3C//DTD XHTML 1 .0 Strict//EN"> 

<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"> 
<head> 
<title>White Snow</title> 
<meta name="Default_audio" content="0x80" /> 
15 <meta name="Default_subtitle" content="0x20" /> 

<script type= n text/javascript"> 
function RcKeyEventHandler(evt) 

{ 

if(evt.keyCode == 10) 
20 MDLvideo.lnputRCKey(10); 
if(evt.keyCode==11) 

MDLvideo.lnputRCKey(1 1 ); 
if(evt.keyCode ==12) 

MDLvideo.lnputRCKey(1 2); 
25 if(evt.keyCode == 13) 

MDLvideo.lnputRCKey(13); 
if(evt.keyCode == 14) 

MDLvideo.lnputRCKey(14); 
if(evt.keyCode == 15) 
30 MDLvideo.lnputRCKey(15); 
if(evt.keyCode == 16) 
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MDLvideo.lnputRCKey(1 6); 
if(evt.keyCode == 17) 

MDLvideo.lnputRCKey(1 7); 
if(evt.keyCode == 18) 
5 MDLvideo.lnputRCKey(18); 

} 

</script> 

<script type="text/javascript"> 
function setupEventHandler() 
10 { 

bodyNode.addEventListener("rckeypress",RcKeyEventHandler,true); 
} 

</script> 

</head> 

15 

<body id="bodyNode" onload= n setupEventHandler()"> 
<object data="MDLvideo;" /> 
</body> 
</html> 

20 

A key event interface of a user input device is as follows: 
interface RCKeyEvent: UlEvent { 

const unsigned long VK_0 = 0; // keyCode is "0", 

keyName is "0" 

25 const unsigned long VK_1 = 1 ; // "1" 

const unsigned long VK_2 = 2; // "2" 

const unsigned long VK_3 = 3; // "3" 

const unsigned long VK_4 = 4; // "4" 

const unsigned long VK_5 = 5; // "5" 

30 const unsigned long VK_6 = 6; // "6" 

const unsigned long VK_7 = 7; // "7" 
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const unsigned long VK_8 = 8; // "8" 

const unsigned long VK_9 = 9; // "9" 

const unsigned long VK_PLAY_PAUSE = 1 0 

const unsigned long VK_STOP = 1 1 

5 const unsigned long VK_FF = 12 

const unsigned long VK_FR =13 

const unsigned long VK_SKI P_PREV = 1 4 

const unsigned long VK_SKI P_N EXT = 1 5 

const unsigned long VK_SUBTITLE = 1 6 

10 const unsigned long VK_AUDIO =17 

const unsigned long VK_MENU =18 

const unsigned long VK_UP =19 

const unsigned long VK_LEFT =20 

const unsigned long VK_RIGHT = 21 

is const unsigned long VK_DOWN = 22 

const unsigned long VK_OK = 23 

const unsigned long VK_RETURN =24 

const unsigned long VKJEXIT = 25 

readonly attribute unsigned long keyCode; 
20 readonly attribute DOMString keyName; 

void initRCKeyEvent (in DOMString typeArg, 

in boolean canBubbleArg, 

in boolean cancelableArg, 

in unsigned long keyCode, 
25 in unsigned long keyName); 

} 

The above examples of the markup document reveals that 
"Markup Document 1" primarily includes layout information, which is 
related to a video object displayed by the markup document, and a script, 
30 which is used to control the reproduction data recorded on the second 
layer in response to the user input. Also, the key event interface of the 



// "PlayPause" 
// "Stop" 
// "FastForward" 
// "FastRewind" 
// "Prev" 
// "Next" 
// "Subtitle" 
// "Audio" 
// "Menu" 
// "Up" 
// "Left" 
// "Right" 
// "Down" 
// "OK" 
// "Return" 
// "Exit" 



26 



WO 2004/025452 PCT/KR2003/001877 



user input device defines an interface that allows key values, used in the 
user input device, to be used in a document object model (DOM). 

First, the above example of the markup document includes 
sequentially a declaration that enables the use of the XHTML and an 
5 element <head>, in which the element <title>, the element <meta>, and 
the element <script> are included. In general, the element <title> 
represents the title subject of the markup document, and the element 
<meta> indicates default audio and subtitles in the video object, which is 
reproduced in the markup document. 

10 Also, event registration information according to the user input is 

described using JavaScript language, as follows: 

<script type="text/javascript"> 
function setupEventHandler() 

{ 

15 bodyNode.addEventListener("rckeypress",RcKeyEventHandler,true); 
} 

</script> 

The event registration information indicates that a function 
RcKeyEventHandler is called when an event rckeypress occurs, i.e., the 
20 user presses a key of the user input device 400. 

Also, event processing information is described using the 
JavaScript language as follows: 

<script type^'text/javascript'^ 

function RcKeyEventHandler(evt) 

25 { 

if(evt.keyCode== 10) 

MDLvideo.lnputRCKey(10); 
if(evt.keyCode ==11) 

MDLvideo.lnputRCKey(1 1 ); 
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if(evt.keyCode ==12) 

MDLvideo.lnputRCKey(12); 
if(evt.keyCode == 13) 

MDLvideo.lnputRCKey(13); 
5 if(evt.keyCode == 14) 

MDLvideo.lnputRCKey(14); 
if(evt.keyCode == 15) 

MDLvideo.lnputRCKey(1 5); 
if(evt.keyCode == 16) 
10 MDLvideo.lnputRCKey(16); 
if(evt.keyCode == 17) 

MDLvideo.lnputRCKey(1 7); 
if(evt.keyCode == 18) 

MDLvideo.lnputRCKey(1 8); 

15 } 

</script> 

The event processing information indicates that MDLvideo. 
InputRCKey(IO) is executed when a key code RcKeyEventHandler is 10. 
Here, the object MDLvideo performs data reproduction using the 

20 additional data stored in the second layer, that is, the object MDLvideo 
corresponds to the playback engine. In other words, if a code allocated 
to a key is 1 0 and the user presses the key, the presentation and 
navigation engine of FIG. 18 sends a control command to the playback 
engine using a command API MDLvideo.lnputRCKey. In response to 

25 the control command, the playback engine performs the reproduction 
control operation allocated to the key code 10, i.e., reproduces or 
temporarily stops the reproduction of a video object. 

The object MDLvideo is embedded in the element <body> of the 
markup document using the element <object>. Meanwhile, it is possible 
30 to embed a plurality of objects in the element <body> of the markup 
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document In this case, a layout of the markup document may use 
cascading style sheet (CSS). 

Next, a method of providing a specific functional command to the 
playback engine and performing navigation will be described. FIG. 19 
5 illustrates a method of forming the menu screen for the navigation. 

Image and text data are described as presentation data to display 
on the screen. In accordance with an aspect of the present invention, 
the screen may include the text data described with XHTML, or image 
data recorded on the first layer. Referring to FIG. 19, four buttons, i.e., 

10 title 1 , title 2, title 3, and return, are displayed on the screen. The image 
or the text data forming the four buttons is described using XHTML. In 
the screen, the user can select and press one of the buttons. More 
specifically, the user applies directional keys of the user input device to 
select one of the buttons and applies an OK key to press the selected 

15 button. Alternatively, the screen may be constructed such that a certain 
operation may be executed when the user presses a certain button using 
the keys of the user input device as access keys. When the user 
selects one of the buttons, the selected button is highlighted. The 
above functions are provided by a presentation module of the 

20 presentation and navigation engine. In accordance with an aspect of 
the present invention, the presentation module may be an XHTML 
browser. If the user presses one button, the operation connected to the 
button is performed. 

Likewise, a command may be provided to the playback engine to 
25 reproduce multimedia data using the data recorded on the first and 
second layers. That is, the presentation and navigation engine may 
provide the playback engine with the command for controlling the 
reproduction unit recorded on the second layer. 

- Markup Document 2 
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<?xml version="1.0" encoding="UTF-8 n ?> 
<!DOCTYPE html 

PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"> 
5 <html xmlns="http://www.w3.org/1 999/xhtml" xml:lang="en" lang="en"> 
<head> 
<title>White Snow</title> 
<meta name="Default_audio" content="0x80" /> 
<meta name="DefauIt_subtitle" content="0x20" /> 
10 </head> 
<body> 
<table> 
<tr> 

<td><img src="file1 jpg" alt="title1 7></td> 
15 <td><button type="button" name="title 1 " 

onclick="MDLvideo.playTitle(1)7x/td> 
</tr> 
<tr> 

<td><img src= ,, flle2.jpg" alt="title27></td> 
20 <td><button type="button" name="title 2" 

oncHck="MDLvideo.playTitle(2)7></td> 
</tr> 
<tr> 

<td><img src="file3.jpg" alt="title37></td> 
25 <td><button type="button" name="title 3" 

onclick="MDLvideo.playTitle(3)7></td> 
</tr> 
<tr> 

<td><button type="button" name="Return" 
30 onclick="MDLvideo.play()7></td> 
</tr> 
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</body> 
</html> 

The Markup Document 2 represents the menu screen illustrated in 
FIG. 19. Referring to the Markup Document 2, commands to control the 

5 reproduction units recorded on the second layer are described in an 
'onclick' event of each button. When a button to reproduce title 1 is 
clicked, a command MDLvideo.playTitle(l) is sent to the navigation 
engine. Then, the navigation engine provides the commands and 
parameters to the playback engine for reproduction of corresponding 

10 data. For instance, playTitle is a title reproduction parameter that 

indicates title number. A function to provide such a command is called a 
method. 

Examples of the method are as follows: 

method for an object MDLvideo { 
15 playTitle(title_jio): reproduce data starting from a title indicated in 
title_no. 

playTime(time): reproduce data starting from indicated time. 
pauseQ: pause data reproduction 
stop(): stop data reproduction 
20 play(): start data reproduction 

forward_scan(speed): high-speed forward data reproduction at 
a speed indicated in (speed) 

backward_scan (speed): high-speed backward data 
reproduction at a speed indicated in (speed) 
25 playChapter(chapterjio): when chapterjio == 0, move to 

chapter having the largest chapter ID and reproduce data of the chapter. 

If chapterjio > Maximum chapter ID, move to a chapter having the 
smallest chapter ID and reproduce data of the chapter. 

} 
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If specific conditions required by the playback engine are satisfied, 
an event to process the data recorded on the third layer is generated in 
the presentation and navigation engine. For instance, whenever a 
chapter begins, the event is generated and provided to the presentation 
5 and navigation engine, and then, the presentation engine displays a start 
of the chapter in the screen. The information regarding the event that is 
provided to the presentation and navigation engine and registered to the 
playback engine may also be recorded on the third layer. 

- Markup Document 3 
10 <smil> 

<head> 

<meta name="title" content-'White Snow'7> 
<meta name="Default_audio" content="0x80" /> 
<meta name= n Default_subtitle" content="0x20" /> 
15 <layout> 

<root-layout width="1920" height="10807> 
<region id="img_scene1" top="45" left="30" width="600" 
height="3007> 

<region id="img_scene2" top="390" left= n 30" width="600" 
20 height="3007> 

<region id="img_scene3" top="735" left="30 M width="600" 
height="3007> 

<region id="img_title_button1" top="95" left="660" width="600" 
height="2007> 

25 <region id="img_title_button2 n top="440" left= ,, 660" width="600" 

height="2007> 

<region id="img_title_button3 n top="785" left="660" width="600" 
height="2007> 

<region id^'img^eturn" top= n 885" left^^Cr width="600" 
30 height="1507> 
</layout> 
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</head> 
<body> 
<par> 

<img src="file1 .jpg" region="img_scene1 7> 
5 <img src="file2.jpg" region="img_scene27> 

<img src="f ile3.jpg" region="img_scene37> 
<img src="file4.jpg?url=command:MDLvideo.playTitle(1 )" 
region="img_title_button1 7> 

<img src="file5.jpg?url=command:MDLvideo.playTitle(2) ,, 
10 region="img_title_button27> 

<img src="file6.jpg?url=command:MDLvideo.playTitle(3)" 
region="img_titIe_button37> 

<img src= ,, file7.jpg?url=command:MDLvideo.play()" 
region="img_return7> 
15 </par> 
</body> 
</smil> 

The Markup Document 3 represents the data that is stored in the 
third layer and described using SMIL, which is the markup language with 

20 timing and synchronization functions. SMIL is also largely divided into 
the elements <head> and <body>. In the Markup Document 3, the 
element "head" includes elements "meta" and "layout". The element 
"meta" has already been described with respect to the above markup 
documents and its description will be omitted here. The element 

25 "layout" has children elements "root-layout" and "region". The element 
"root-layout" describes a size and a background color of a document to 
be displayed. The element "region" describes the layout of a SMIL 
document region where each media clip is formed, and an ID of the 
respective region. 

30 The element "body" includes the following elements, which 

describes media clip sources: 
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onimation /> - Shockwave Flash File (.swf) 

<audio /> - Audio Clip File (.rm) 

<img /> - Image Clip File (.jpg.gif) / Not Use Animation GIF 

<ref /> - Document of Different Use (.htm.rp ) 

5 <text ....../>- Text (.txt) 

<textstream..../> - Text Stream (.rt) 
<video /> - Video Clip (.rm) 

In the Markup Document 3, the element "img" is used to create 
the menu screen shown in FIG. 19. The element "img" can be used to 

io link a document to a desired image or insert a command into the 

document. Also, Markup Document 3 may further include elements "a" 
and "anchor" for use as a hyperlink. In Markup Documents, all images 
are linked except three images. In SMIL, a reproduction control attribute 
"url" in the element "img" may have the following attribute values, 

15 including playTitle: 

url :: hyperlink URL. 

command:MDLvideo.playTitle(titIe_no): reproduce data starting from a 
title indicated in tit!e_no. 

command:MDLvideo.playTime(time): reproduce data starting from time 
20 indicated in (time). 

command :MDLvideo.play(): start data reproduction. 

command:MDLvideo.stop(): stop data reproduction. 

command:MDLvideo.pause(): pause data reproduction. 

command:MDLvideo.forward_scan(speed): high-speed forward 
25 reproduction at a speed specified in (speed). 

command:MDLvideo.backward_scan(speed): high-speed backward 

reproduction at a speed specified in (speed). 

command:MDLvideo.pIayChapter(chapter_no): if chapter_no == 0, move 
to a chapter having the largest chapter ID and reproduce data of the 
30 chapter. 

If chapter_no > Maximum chapter ID, move to a chapter having the 
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smallest chapter ID and reproduce data of the chapter. 

Industrial Applicability 

As described above, according to a storage medium of the 
5 present invention, information regarding reproduction of multimedia data 
is divided into record units and reproduction units, and additional 
information regarding the record units and reproduction units are 
recorded on a second layer. Also, information regarding selection and 
navigation of a desired reproduction unit is recorded on a third layer. In 
10 this way, the operation of each layer can be distinguished. In the 

second layer, the data is recorded using a markup language to improve 
extensibility. In the third layer, the data is also recorded using the 
markup language to create additional data, representing a menu screen 
or the layout of a reproduction unit. The data regarding selection of the 
15 reproduction unit and the reproduction sequence is described using a 
script language or the markup language with timing and synchronization 
functions. 

In order to store the multimedia data in the storage medium, a 
multimedia data recording apparatus, according to an aspect of the 

20 present invention, records the multimedia data on the first layer of the 
storage medium, divides the additional information of the multimedia 
recorded on the first layer into the record units and the reproduction units, 
and records the divided additional information on the second layer in a 
table format or by using the markup language. Here, the record units 

25 and the reproduction units may have multiple hierarchical structures. 
According to an aspect of the present invention, the record unit is a clip 
that is made by linking time information to position information when 
video object data is recorded at VBR, and the reproduction units are cells 
linked to the clip or a portion of the clip, chapters linked to a plurality of 

30 cells, and titles linked to a plurality of chapters. 
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In addition, navigation data regarding selection of the reproduction 
unit or the reproduction sequence is recorded on the third layer. The 
navigation data is described using either the script language executed in 
units of lines, or the markup language with timing and synchronization 
5 functions. Also, in the third layer, presentation data, representing a 
menu screen to select the reproduction unit or the reproduction 
sequence, is described using the markup language. 

A storage medium, according to an aspect of the present invention, 
includes the multimedia data and the additional data. The additional 

10 data includes two different types of data: the additional information, 

which includes the record unit, attributes, and the reproduction unit of the 
multimedia data, and the navigation information, which relates to a 
selection of the reproduction unit and the reproduction sequence. The 
additional information is described using the markup language, thereby 

15 enabling an addition of new multimedia data regardless of an extension 
of a standard. Also, even if a new record unit or reproduction is 
prescribed, it is easy to support implementation of the extension. 
Further, both or one of the markup language and the script language is 
used to describe navigation data, which represents selection of the 

20 reproduction unit or the reproduction sequence. The markup language 
is also used to describe presentation data, which represents the menu 
screen to select the reproduction unit and a screen layout for the data 
reproduction, thereby enabling a menu structure and navigation with a 
high degree of flexibility. 

25 While this invention has been particularly shown and described 

with reference to aspects thereof, it will be understood by those skilled in 
the art that various changes in form and details may be made therein 
without departing from the spirit and scope of the invention as defined by 
the appended claims. 
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What is claimed is : 

1 . A multimedia data storage medium, comprising: 
a first layer storing multimedia data; and 

a second layer in which, when the multimedia data is divided into 
5 a record unit and a reproduction unit, information of attributes of the 
record unit and a relationship between the record unit and the 
reproduction unit are described with a markup language. 

2. The medium of claim 1, wherein the multimedia data 

10 recorded on the first layer is video object data, still image data, or audio 
data. 



3. The medium of claim 1 , wherein when the multimedia data 
is video object data, the multimedia data is coded at a variable bit rate 
15 (VBR), and a reproduction time and linkage information of a position of 
reproduction data are described as temporal and position information, 
where the reproduction time and the linkage information are in a table 
format and recorded on the first layer. 

20 4. The medium of claim 3, wherein the record unit comprises 

a clip made by linking the video object data to the temporal and position 
information. 

5. The medium of claim 3, wherein the reproduction unit 
25 comprises a cell designating the record unit or a portion of the record 

unit 

6. The medium of claim 4, wherein the reproduction unit 
comprises a cell designating the record unit or a portion of the record 

30 unit. 
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7. The medium of claim 1 , wherein the reproduction unit has a 
hierarchical structure of at least two layers. 

8. The medium of claim 2, wherein the reproduction unit has a 
5 hierarchical structure of at least two layers. 

9. The medium of claim 7, wherein the reproduction unit has 
two layers comprising a cell, which designates the record unit or a 
portion of the record unit, and a chapter linked to a plurality of cells. 

10 

10. The medium of claim 9, wherein the reproduction unit has 
three layers comprising a cell, which designates the record unit or a 
portion of the record unit,, a chapter, and a title linked to the chapter. 

15 11. The medium of claim 1 , wherein the information described 

using the markup language comprises a clip, a cell, a chapter, and a title. 

1 2. The medium of claim 1 0, wherein the information described 
20 using the markup language comprises a clip, the cell, the chapter, and 

the title. 

1 3. The medium of claim 1 , further comprising: 

a third layer storing navigation data, which is used to control a 
25 selection of the reproduction unit and reproduction sequence. 

14. The medium of claim 9, further comprising: 

a third layer storing navigation data, which is used to control a 
selection of the reproduction unit and reproduction sequence. 

30 

1 5. The medium of claim 1 0, further comprising: 
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a third layer storing navigation data, which is used to control a 
selection of the reproduction unit and reproduction sequence. 

16. A multimedia data storage medium, comprising: 
5 a first layer storing multimedia data; 

a second layer in which, when the multimedia data is divided into 
a record unit and a reproduction unit, information of attributes of the 
record unit and a relationship between the record unit and the 
reproduction unit are described in a table format; and 
io a third layer storing navigation data of a selection of the 

reproduction unit and reproduction sequence. 

1 7. The medium of claim 16, wherein the navigation data 
stored in the third layer is a script language that is interpreted and 

15 executed in units of lines, and controls the selection of the reproduction 
unit and the reproduction sequence. 

1 8. The medium of claim 16, wherein the navigation data 
stored in the third layer is a markup language with timing and 

20 synchronization functions, and controls the selection of the reproduction 
unit and the reproduction sequence. 

19. The medium of claim 18, wherein presentation data is 
described with a markup language and stored in the third layer, and 

25 comprises information of a layout of a menu screen and a screen 
structure of the reproduction unit. 

20. The medium of claim 18, wherein the navigation data is an 
interface to select and control the reproduction unit recorded on the 

30 second layer. 
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21 . The medium of claim 18, wherein the navigation data 
comprises information of registration and execution of an event 
generating specific conditions when the reproduction unit is reproduced. 

5 22. A multimedia reproduction apparatus reproducing data 

stored in a multimedia data storage medium, comprising a first layer 
storing multimedia data, and a second layer describing information of 
attributes of a record unit and a relationship between the record unit and 
a reproduction unit using a markup language, wherein when the 

10 multimedia data is divided into the record unit and the reproduction unit, 
the information of the record unit and the reproduction unit is read from 
the second layer, a desired position of the data selected by a user, is 
detected that corresponds to the record unit, and the data is reproduced 
at the detected position. 

15 

23. The apparatus of claim 22, wherein a menu screen is 
provided to select the reproduction unit and to reproduce the data of the 
selected reproduction unit, when the data of the selected reproduction 
unit is stored by at least two layers. 

20 

24. The apparatus of claim 22, wherein navigation information 
of the selection of the reproduction unit and reproduction sequence is 
stored on a third layer, and the data of the selected reproduction unit is 
reproduced based on the navigation information. 

25 

25. A multimedia reproduction apparatus reproducing data 
stored in a multimedia data storage medium, comprising a first layer 
storing multimedia data, a second layer describing information of 
attributes of a record unit and a relationship between the record unit and 

30 a reproduction unit are stored in a table format when the multimedia data 
is divided into the record unit and the reproduction unit, and a third layer 
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comprising navigation information of a selection of the reproduction unit 
and a reproduction sequence, wherein a reproduction of the reproduction 
unit is based on the navigation information and a user selection. 

26. The apparatus of claim 25, wherein when the navigation 
5 information recorded on the third layer is described using a script 

language that is interpreted and executed in units of lines, a navigation is 
performed by executing the script language, where the navigation 
information controls a selection of the reproduction unit and the 
reproduction sequence. 

10 

27. The apparatus of claim 25, wherein when the navigation 
information recorded on the third layer is described using a markup 
language with timing and synchronization functions, a navigation is 
performed to carry out the timing and synchronization functions and the 

15 navigation information controls a selection of the reproduction unit and 
the reproduction sequence. 

28. The apparatus of claim 25, wherein when the storage 
medium comprises presentation data described using a markup 

20 language, the presentation data is read from the storage medium and 
output in a form of a menu screen representing a user interface. 

29. A multimedia data recording apparatus recording 
multimedia data on a storage medium, the apparatus recording 

25 multimedia data on a first layer of the storage medium, dividing the 
multimedia data into a record unit and a reproduction unit, recording 
information and linkage information of the record unit and the 
reproduction unit using markup language, and storing the information 
and linkage information on a second layer of the storage medium, 

30 wherein the multimedia data comprises video object data, still image data, 
audio data, or graphic data. 
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30. The apparatus of claim 29, wherein navigation data is 
recorded on a third layer of the storage medium to control a selection of 
the reproduction unit and a reproduction sequence. 

5 

31 . A multimedia data recording apparatus recording 
multimedia data in a storage medium, the apparatus recording 
multimedia data on a first layer of the storage medium, dividing the 
multimedia data recorded on the first layer into a record unit and a 

10 reproduction unit, recording information and linkage information of the 
record unit and the reproduction unit on a second layer in a table format, 
and storing navigation data in a third layer, wherein the multimedia data 
comprises video object data, still image data, audio data, or graphic data, 
and the navigation data controls a selection of the reproduction unit and 

15 a reproduction sequence. 

32. The apparatus of claim 31 , wherein the navigation data is 
described using a script language that is interpreted and executed in 
units of lines. 

20 

33. The apparatus of claim 31 , wherein the navigation data is 
described using a markup language with timing and synchronization 
functions. 

25 34. The apparatus of claim 31 , wherein presentation data is 

recorded on a third layer of the storage medium using a markup 
language, and the presentation data comprises a menu screen to select 
the reproduction unit or a screen layout to display the multimedia data. 

30 35. A multimedia data recording method recording multimedia 

data on a storage medium, comprising: 

storing multimedia data in a first layer; 
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dividing the multimedia data into a record unit and a reproduction 
unit; and 

providing a second layer describing with a markup language 
information of attributes of the record unit and a relationship between the 
5 record unit and the reproduction unit. 

36. A multimedia data recording method recording multimedia 
data on a storage medium, comprising: 

storing multimedia data in a first layer; 
io dividing the multimedia data into a record unit and a reproduction 

unit; 

providing a second layer describing in a table format information 
of attributes of the record unit and a relationship between the record unit 
and the reproduction unit; and 
15 storing in a third layer navigation data of a selection of the 

reproduction unit and reproduction sequence. 

37. A multimedia data recording method recording multimedia 
data on a storage medium, comprising: 

20 recording the multimedia data on a first layer of the storage 

medium; 

dividing the multimedia data into a record unit and a reproduction 

unit; 

recording information and linkage information of the record unit 
25 and the reproduction unit using markup language; and 

storing the information and linkage information on a second layer 
of the storage medium, wherein the multimedia data comprises video 
object data, still image data, audio data, or graphic data. 

30 38. A multimedia data recording method recording multimedia 

data in a storage medium, comprising: 
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recording the multimedia data on a first layer of the storage 
medium; 

dividing the multimedia data recorded on the first layer into a 
record unit and a reproduction unit; 

5 recording information and linkage information of the record unit 

and the reproduction unit on a second layer in a table format; and 
storing navigation data in a third layer, wherein the multimedia data 
comprises video object data, still image data, audio data, or graphic data, 
and the navigation data controls a selection of the reproduction unit and 

10 a reproduction sequence. 
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